Interactional adequacy as a factor in the perception of synthesized speech
نویسندگان
چکیده
Speaking as part of a conversation is different from reading out aloud. Speech synthesis systems, however, are typically developed using assumptions (at least implicitly) that are more true of the latter than the former situation. We address one particular aspect, which is the assumption that a fully formulated sentence is available for synthesis. We have built a system that does not make this assumption but rather can synthesize speech given incrementally extended input. In an evaluation experiment, we found that in a dynamic domain where what is talked about changes quickly, subjects rated the output of this system as more ‘naturally pronounced’ than that of a baseline system that employed standard synthesis, despite the synthesis quality objectively being degraded. Our results highlight the importance of considering a synthesizer’s ability to support interactive use-cases when determining the adequacy of synthesized speech.
منابع مشابه
مقایسه سطح ادراک شنیداری و وضوح کلامی بعد از کاشت حلزون در بیماران پرهلینگوال مبتلا به کمشنوایی عمیقی ارثی و غیرارثی مراجعه کننده به بیمارستان حضرت رسول اکرم(ص)
Background & Aim: When inner ear is disturbed, both hearing sensitivity and selective property decrease. Early rehabilitation for proper progression of speech and language appropriate to age is mandatory. Several studies were performed to compare factors that affect the results of cochlear implantations to select the best candidates on the basis of different criteria. This study was underta...
متن کاملMetadiscourse Markers Revisited in EFL Context: The Case of Iranian Academic Learners’ Perception of Written Texts
Moving in line with the postulation that metadiscourse (MD) markers help transform a dry and tortuous piece of text into a coherent and reader-friendly one, the researchers in the current study attempted to investigate the effect different metadiscourse markers might have on Iranian EFL learners’ perception of written texts. To this end, 120 undergraduate English students were given three diffe...
متن کاملThe Role of Sociolinguistics in Second Language Acquisition
Learning a new language also involves learning a broad system of norms for social relations.This study broadly showed how EFL learners’ speech act is conveyed from their nativecultures when they are communicating in English and demonstrated that there are somepossibilities of cross-cultural misunderstanding when interlocutors are engaged in the speechact of complimenting with native speakers of...
متن کاملStudy on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملCorrelation between Auditory Spectral Resolution and Speech Perception in Children with Cochlear Implants
Background: Variability in speech performance is a major concern for children with cochlear implants (CIs). Spectral resolution is an important acoustic component in speech perception. Considerable variability and limitations of spectral resolution in children with CIs may lead to individual differences in speech performance. The aim of this study was to assess the correlation between auditory ...
متن کامل